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Abstract 

In classical scheduling problems, we arc given jobs and machines, and have to schedule all the jobs 
to minimize some objective function. What if each job has a specified profit, and we are no longer 
required to process all jobs — we can schedule any subset of jobs whose total profit is at least a (hard) 
target profit requirement, while still approximately minimizing the objective function? 

We refer to this class of problems as scheduling with outliers. This model was initiated by Charikar and 
KhuUer (SODA'06) on the minimum max-response time in broadcast scheduling. In this paper, we 
consider three other well-studied scheduling objectives: the generalized assignment problem, average 
weighted completion time, and average flow time, and provide LP-based approximation algorithms 
for them. Our main results are: 

• For the minimum average flow time problem on identical machines, we give a logarithmic 
approximation algorithm for the case of unit profits based on rounding an LP relaxation; we 
also show a matching integrality gap. While the LP relaxation has been used before, the 
rounding algorithm is a delicate one. 

• For the average weighted completion time problem on unrelated machines, we give a constant- 
factor approximation. The algorithm is based on randomized rounding of the time-indexed LP 
relaxation strengthened by the knapsack-cover inequalities. 

• For the generalized assignment problem with outliers, we give a simple reduction to GAP with- 
out outliers to obtain an algorithm whose makespan is within 3 times the optimum makespan, 
and whose cost is at most (1 -|- e) times the optimal cost. 
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1 Introduction 



In classical scheduling problems, we are given jobs and machines, and have to schedule all the jobs to 
minimize some objective function. What if we are given a (hard) profit constraint, and merely want 
to schedule a "profitable" subset of jobs? In this paper, we consider three widely studied scheduling 
objectives — makespan, weighted average completion time, and average flow-time — and give approxi- 
mation algorithms for these objectives in this model of scheduling with outliers. 

Formally, the scheduling with outliers model is as follows: given an instance of some classical scheduling 
problem, imagine each job j also comes with a certain profit ttj. Given a target profit 11, the goal is now 
to pick a subset of jobs S whose total profit '^j^s least 11, and to schedule them to minimize the 

underlying objective function. (Equivalently, we could define the "budget" B = ttj — 11, and discard 
a subset of "outlier" jobs whose total profit is at most B.) Note that this model introduces two different 
sources of computational difficulty: on one hand, the task of choosing a set of jobs to achieve the profit 
threshold captures the knapsack problem; on the other hand, the underlying scheduling problem may 
itself be an intractable problem. 

The goal of picking some subset of jobs to process as efficiently as possible, so that we attain a minimum 
level of profit or "happiness" , is a natural one. In fact, various problems of scheduling with job rejections 
have been studied previously: a common approach, studied by Bartal et al. has been to study "prize- 
collecting" scheduling problems (see, e.g., ^, 11, ^), where we attempt to minimize the scheduling 



objective plus the total profit of unscheduled jobs. One drawback of this prize-collecting approach is 
that we lose fine-grained control on the individual quantities — the scheduling cost, and the lost profit — 
since we naively sum up these two essentially incomparable quantities. In fact, this makes our model 
(with a hard target constraint) interesting also from a technical standpoint: while we can reduce the 
prize-collecting problem to the target profit problem by guessing the lost profit in the optimal prize- 
collecting solution, reductions in the opposite direction are known only for a handful of problems with 



very restrictive structure (see Section 1.2 for a discussion). 

To the best of our knowledge, the model we investigate was introduced by Charikar and Khuller Q, 
who considered the problem of minimizing the maximum response time in the context of broadcast 
scheduling; one of our results is to resolve an open problem from their paper. Scheduling problems with 
outliers were also implicitly raised in the context of model-based optimization with budgeted probes: 
Guha and Munagala |0] gave an LP-based algorithm for completion-time scheduling with outliers 
which violated budgets by a constant factor — we resolve an open problem in their paper by avoiding 
any violation of the budgets. 

1.1 Our results 

GAP and makespan. As a warm-up, we study the Generalized Assignment Problem, a generalization 



of the makespan minimization problem on unrelated machines, in Section 2. For this problem, we give 
a simple reduction to the non-outlier version of this problem to get a solution approximating the 
makespan and cost by factors of 3 and (1 -|- e) respectively. Recall that the best non-outlier guarantee 



is a 2-approximation |29] without violating the cost — however, it is easy to show that in the presence 



of outliers the (1 + e) loss in cost are unavoidable unless P = NP. 

Average completion time. We then consider the problem of minimizing the sum of weighted com- 
pletion times on unrelated machines with release dates in Section ^ . 

Theorem 1.1. For i?|rj, outliers | ^jWjCj, there is a randomized 0{1)- approximation algorithm. 



Our algorithm is based on approximately solving the time- indexed LP relaxation of Schulz and Skutella |27| 
strengthened with knap sack- cover inequalities followed by randomized rounding. We improve on this 
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result to obtain an FPTAS for unweighted sum of completion times on a constant number of machines. 
(The best non-outlier upper bound for Rl'fjl'^jWjCj is a 2-approximation due to Skutella |3C]; this 
problem is also known to be APX-hard 



Average flow time. This is the technical heart of the paper. The problem is to minimize the average 
(preemptive) flow time on identical machines i-*|rj,pmtn, outlier s| ^Fj. Our main result is: 

Theorem 1.2. For P | r^, pmtn, outliers | ^ -Fj, when all jobs have unit profits, there is an 0{logP)- 
approximation algorithm, where P is the ratio between the largest and smallest processing times. 

This comes close to matching the best known result of 0(log min{P, n/m}) for the non-outlier version 



due to Leonardi and Raz |23]. However, this problem seems to be much harder with outliers, as we get 
the same approximation even on a single machine, in contrast to the non-outlier single-machine case 
(which can be solved optimally). We show our approach is tight, as the LP relaxation we use has an 
il(logP) integrality gap. 



The algorithm rounds a linear-programming relaxation originally suggested in |13]; however, we need 



new ideas for the rounding algorithm over those used by [13|. At a high-level, here is the idea behind our 



rounding algorithm: the LP might have scheduled each job to a certain fractional amount, and hence 
we try to swap "mass" between jobs of near-equal processing times in order to integrally schedule a 
profitable subset of jobs. However, this swapping operation is a delicate one, and merely swapping mass 
locally between nearby jobs has a bad algorithmic gap. Furthermore, we need to handle jobs that are 
only approximately equal in size, which leads to additional difficulties. (For a more detailed high-level 
sketch of these issues, please read Section O .) 



1.2 Related work 

Scheduling with rejections. As mentioned above, previous papers on this topic considered the "prize- 
collecting" version which minimizes the scheduling objective plus the total profit of unscheduled jobs; 
their techniques do not seem to extend to scheduling with outliers, in which we have a strict budget on the 
total penalty of rejected jobs. Bartal et al. considered offline and online makespan minimization and 
gave best-possible algorithms for both cases. Makespan minimization with preemptions was investigated 



by [|l8| , 28|. Epstein et al. [1^ examined scheduling unit-length jobs. Engels et al. studied the prize- 
collecting version of weighted completion-time minimization (on single or parallel machines), and gave 
PTASs or constant-factor approximations for these problems; they also proposed a general framework 
for designing algorithms for such problems. 

Outlier versions of other problems. Also called partial- covering problems, these have been widely 
studied: e.g., the fc-MST problem |12], the /c-center and facility location problem Q and the A;- median 



problem with outliers partial vertex cover (e.g., |25| and references therein) and fe-multicut |15, 24|. 



Chudak et al. distilled ideas of Jain and Vazirani 1 1£ ] on converting "Lagrange- multiplier preserving" 



algorithms for prize-collecting Steiner tree into one for fc-MST; Konemann et al. [^T| gave a general 
framework to convert prize-collecting algorithms into algorithms for outlier versions (see also [^]). We 
cannot use these results, since it is not clear how to make the algorithms for prize-collecting scheduling 
problems to also be Langrange-multiplier preserving, or whether the above-mentioned framework is 
applicable in scheduling-related scenarios. 

2 GAP and Makespan 

As a warm-up, we consider the generalized assignment problem, which is an extension of minimizing 
makespan on unrelated machines with outliers. Formally, the instance Z has m machines and n jobs. 
Each job j has a processing time of pij on machine i, an assignment cost of Cjj, and a profit of vTj. Given 
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a profit requirement IT, cost bound C and makespan bound T, the goal is to obtain a feasible schedule 
satisfying these requirements (or to declare infeasibility) . Of course, since the problem is NP-hard, we 
look at finding solutions where we violate the cost and makespan bounds, but not the (hard) profit 
requirement. We now show how to reduce this problem to the non-outlier version studied earlier, while 
incurring small additional losses in the approximation guarantees. 

Theorem 2.1. Given an instance I of GAP-with-outliers with optimal cost C , and makespan T , there 
is a polynomial time algorithm to output an assignment with cost (1 + e)C and makespan 3T. 

Proof. Given the instance I, construct the following instance I' of the standard GAP (where there 
are no profits or outliers). There are m + 1 machines: machines 1,2, ... ,m are the same as those in I, 
while machine m + 1 is a "virtual profit machine". We have n jobs, where job j has a processing time 
of pij and an assignment cost of Cij when scheduled on machine i (for 1 < i < m). If job j is scheduled 
on the virtual machine m + 1, it incurs a processing time of ttj and cost zero: i.e., P(m+i)j = ^-nd 
C(m+i)j = 0- For this instance I', we set a cost bound of C, makespan bound of T for all machines 
1 < i < m, and a makespan bound of Typm '■= (X^j=i ^Tj) — n for the virtual profit machine. Note that 
any feasible solution for 2 is also feasible for I' , with the outliers being scheduled on the virtual profit 
machine, since the total profit of the outliers is at most T^ipm = {Yl^=i'^j) ~ ^■ 

We can now use the algorithm of Shmoys and Tardos ||2^ which guarantees an assignment S for the 
GAP instance I' with the following properties: (a) The cost of assignment S is at most C, (h) the 
makespan induced by S on machine i (for 1 <i < m) is at most T + mm{max j pij,T}, and (c) The 
makespan of S on the virtual machine m + 1 is at most ((X^[Li ^i) — H) + maxj ttj. 

Note that this assignment S is almost feasible for the outlier problem I — the makespan on any real 
machine is at most T + maxj pij , the assignment cost is at most C — however, the profit of the scheduled 
jobs is only guaranteed to be at least 11 — maxj ttj. But it is easy to fix this shortcoming: we choose a 
job j' assigned by S to the virtual machine which has the largest profit, and schedule j' on the machine 
where it has the least processing time. Now the modified assignment has cost at most C + maxj^/ Cij', 
makespan at most T + 2 minjmaxj pjj, T}, and the total profit of the scheduled jobs is at least 11. (We 
assume that any job j where miniPij > T has already been discarded.) This is almost what we want, 
apart from the cost guarantee. So suppose we "guess" the 1/e most expensive assignments in OPT (in 
time 0(m?i^/'^)), and hence we can focus only on the jobs having Cij < eC for all possible remaining 
assignments. Now the cost of the assignment is C + maxjj/ Cjj/ < C(l + e), and the makespan is at most 
3T. This completes the proof. ■ 

In fact, the (1 + e) loss in cost is inevitable since we can reduce the knapsack problem to the single 
machine makespan minimization with outliers problem: values of items become profits of jobs, and their 
weights become the assignment cost; the weight budget is the cost budget, and the required value is the 
required profit. As for the makespan guarantee, the 3/2-hardness of Lenstra et al. carries over. 

3 Weighted Sum of Completion Times 

We now turn our attention to average completion time — in particular, to i?|rj, outlier s| wjCj. The 
main result of this section is a constant factor approximation for this problem. Not surprisingly, the 
integrality gap of standard LP relaxations is large[|, and hence we strengthen the time-indexed formula- 



tion with the so-called knapsack-cover inequalities 31 ] . We show that a randomized rounding scheme 



similar to that of Schulz and Skutella |27| gives us the claimed guarantees on the objective function, 
and while preserving the profit requirements with constant probability. 



^Implicit in the work of Guha and Munagala is an algorithm which violates the profit requirement by a constant 
factor; they also comment on the integrality gap, and pose the problem of avoiding this violation. 
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3.1 A Constant Approximation for Weighted Sum of Completion Times 

We have a collection of m machines and n jobs, where each job j is associated with a profit ttj, a weight 
Wj, and a release date rj. When job j is scheduled on machine i, it incurs a processing time of pij. 
Given a parameter 11 > 0, the objective is to identify a set of jobs S and a feasible schedule such that 
X^jgS^i — ^ such that '^j^s'^i^i minimized. Here, Cj denotes the completion time of job j. 

3.1.1 A Time Indexed LP Relaxation 

For the non-outlier version, in which all jobs have to be scheduled, Schulz and Skutella gave a 
constant factor approximation by making use of a time-indexed LP. We first describe a natural extension 
of their linear program to the outlier case, while also strengthening it. 

Vj 

Vj 
Vi,t 

: n(^) < n 

yi,j,t:t< rj 

In this formulation, the variable Xijt stands for the fractional amount of time machine i spends on 
processing job j in the time interval [t,t + 1); note that the LP schedule may be preemptive. The 
variable Cj, defined by constraint (1), is a measure for the completion time of job j. In any integral 
solution, where job j is scheduled from t to t + Pij on a single machine i, it is not difficult to verify 
that Cj evaluates to t + pij. The variable defined by constraint (2), is the fraction of job j being 
scheduled. Constraint (3) ensures that machine i spends at most one unit of processing time in [t, i -|- 1). 
Constraints (5) and (6) are additional feasibility checks. 

We first observe that replacing the set of constraints (4) by a single inequality of the form X]j=i '^iVj — ^ 
would result in an unbounded integrality gap - consider a single job of profit M, and 11 = 1; the LP 
can schedule a 1/M fraction of the job, incurring a cost which is only 1/M times the optimum. We 
therefore add in the family of constraints (4), known as the knapsack- cover (KG) inequalities. Let A be 
any set of jobs, and let n(^) = "^j^zj^iTj be the sum of profits over all jobs in A. Then, [11 — n(^)] + 
is the profit that needs to be collected by jobs not in A when all jobs in A are scheduled. Further, if 
A does not fully satisfy the profit requirement, any job j ^ A has a marginal contribution of at most 
TTj^ = min{7rj,n — n(^)}. Therefore, for every set A such that n(^) < H, we add a constraint of the 
form Ylj^A^f'yi — ^ — n(^). Note that there are exponentially many such constraints, and hence we 
cannot naively solve this LP. 

"Solving" the LP. We will not look to find an optimal solution to the above LP; for our purposes, it 
suffices to compute a solution vector (ir , y, C) satisfying the following: 

(a) Constraints (l)-(3) and (5)-(6) are satisfied. 

(b) Constraint (4) is satisfied for the single set {j : yj > 1/2}. 

(c) J2J=i "^j^j — 2 ■ Opti where Opt denotes the cost of an optimal integral solution. 

We compute this solution vector by first guessing Opt up to a multiplicative factor of 2 (call the guess 
Opt), and add to the LP the explicit constraint Yl^=i'^j^j — Op*- Then, we solve the LP using the 
ellipsoid algorithm. For the separation oracle, in each iteration, we check if the current solution satisfies 
properties (a)-(c) above. If none of these properties is violated, we are done; otherwise, we have a 



minimize J2j=i'^jC^j 

subject to (1) c,=Er=iEr=o(s(*+^)+^) 

(2) y, = ET=iEloTS- 

(3) Ej=i ^ijt < 1 

(4) E,-^^^/%>n-n(^) 

(5) Xijt = 

(6) Xijt > 0, < y,- < 1 
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violated constraint. We now present our rounding algorithm (in Algorithm 1), based on the non-outlier 
algorithm of |27]. 



Algorithm 1 Weighted Sum of Completion Times 

1: given a solution vector {x,y, C) satisfying properties (a)-(c), let A* be the set {j \ yj > 1/2}. 
2: for each job j, do the following steps 

2a: if j G A*, for each {i,t) pair, set kjt = Xijt/ {pijVj). Note that for such jobs j ^ A* , we have 

YllLi Z]t=o ~ from constraint (2) of the LP. 
2b: if j ^ A*, set lijt = 2xijt/pij. In this case, note that YllLi J2t=o^ijt ~ '^Vj- 
2c: partition the interval [0, 1] in the following way: assign each (i, t) pair a sub-interval la of 
[0, 1] of length lijt such that these sub-intervals are pairwise disjoint. Then choose a uniformly 
random number r € [0, 1] and set Tj to be the pair s.t r G In. If there is no such 
pair, leave j unmarked. 

3: for each machine i, consider the jobs such that Tj = (i,*); order them in increasing order of their 
marked times; schedule them as early as possible (subject to the release dates) in this order. 



3.2 Analysis 

We now show that the expected weighted sum of completion times is O(l)0pt, and also that with 
constant probability, the total profit of the jobs scheduled is at least 11. 

Lemma 3.1. The expected weighted sum of completion times is at most 16 • Opt. 

Proof. Let (7^ be a random variable, standing for the completion time of job j; if this job has not 

been scheduled, we set Cj^ = 0. Since Yli'^j=iWjCj < 2 • Opt, it is sufficient to prove that E[Cj^] < 8Cj 
for every j. To this end, note that 

i=l t=0 i=l t=0 

where the last inequality holds since Pr[Tj = (i,t)] = hjt < 2xijt/pij, regardless of whether j € A* or 
not. Now let us upper bound E[C^|tj- = {i,t)]. The total time for which job j must wait before being 
processed on machine i can be split in the worst case into: (a) the idle time on this machine before 
j is processed, and (b) the total processing time of other jobs marked {i,t') with t' < t. If job j has 
been marked (i,t), the idle time on machine i before j is processed is at most t. In addition, the total 
expected processing time mentioned in item (b) is at most 

t t 

Y^p.k E = (», t')\T, ^ {i, t)] = Ep^fc E = ^')] 

k^j t'=a k^j t'=o 

^Ep-E^ = 2EE^-'^2(t+i), 

k^j i'=0 ^''^ t'=Ok=i] 

where the last inequality follows from constraint (3). Combining these observations and constraint (1), 
we have 

E[c^f] <2EE?^(^ + 2(^+i)+p'.)^^8EE f^K^ hT'h 

■ 

Lemma 3.2. The randomized rounding algorithm produces a schedule that meets the profit constraint 
with probability at least 1/5. 



5 



Proof. Clearly, when the jobs in A* collectively satisfy the profit requirement, we are done since the 
algorithm picks every job in this set. In the opposite case, consider the Knapsack Cover inequality for 
A*^ stating that Ylji^A* "^fvi ^ n — n(^*). The total profit collected from these jobs can be lower 
bounded hy Z = X^^-^^* ttj^* Zj, where Zj is a random variable indicating whether job j is picked. 

Since our rounding algorithm picks all jobs in A*, the profit requirement is met if Z is at least H— n(^*). 
To provide an upper bound on the probability that Z falls below IT — n(^*), notice that by the way 
the algorithm marks jobs in Step ^, we have that each job not in A* is marked with probability 2yj, 
independently of the other jobs. Therefore, 

E [z] = E[E,^^. ^fz,] = 2 E,^^. ^fvj > 2(n - n^*)) . 

Consequently, if we define aj = vr^ /(H — n(^*)), then 



Pt[z <n-n{A*)] = Pr 



< Pr 



< exp(-i.E[E,^^.a,Z,])<e-i/4<4 ^ 

where the first and third inequalities hold since E[Ej^_4* Oij^j] ^ 2, and the second inequality follows 
from bounding the lower tail of the sum of independent [0, 1] r.v.s (see, e.g., Thm. 3.5]). ■ 

The above two lemmas combine to give the following theorem. 

Theorem 3.3. For i?|rj, outliers | J2j^jCj) there is a randomized 0(1) -approximation algorithm. 

While the LP formulation as stated has exponentially many time intervals of length 1, we can make our 
algorithm fully polynomial in the size of the input (with a small loss in approximation guarantee) by 
considering geometrically increasing sizes |17] for the time intervals. 



In [Appendix B.l , we show that given K different profit requirements, our algorithm can be modified 



to give an 0(log -R')-approximation. 

3.3 Single Machine, Identical Weights 

In this section, we show how we can get an FPTAS using dynamic programming for the problem of 
minimizing the unweighted sum of completion times on a constant number of machines. For simplicity, 
we first give the complete proof for the case of a single machine, and sketch how to extend it for a 
constant number of machines. 

Single Machine, Identical Weights. We are given a collection of n jobs where job j is associated 
with a processing time pj and a profit iTj. Given a target profit of 11 > 0, the goal is to identify a set of 
jobs 5 and a corresponding single-machine schedule such that YljeS '^i — ^ ^^'^ Sje5 minimized 
(where Cj is the completion time of job j). 

Dynamic program. Suppose pi, . . . ,pn are integers such that pi < ■ ■ ■ < Pn- Let profit(j, C, L) be the 
maximum profit that can be collected by scheduling a subset of jobs {1, . . . , j} such that their sum of 
completion times is at most C and makespan is exactly L. Then, the following recurrence holds: 

profit(j, C, L) = max{profit(j — 1, C, L),TTj + profit(j — l,C — L, L — pj)} 

To better understand the above equation, notice that if job j is picked by an optimal schedule, it will not 
be scheduled before any of the jobs {1, . . . , j — 1} since the shortest processing time strategy is optimal 
for a fixed set of jobs (1|2^). Therefore, consider a set of jobs {1, . . . ,j} that have a bound C on their 
sum of completion times and let L be their makespan. If j is scheduled, the jobs {1, . . . , j — 1} must 
have a residual makespan of L — pj and a bound of C — L on the sum of completion times since job j 
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incurs a completion time of L by virtue of it being scheduled last among {1, . . . , j}; we also collect a 
profit of vTj in this case. On the other hand, if j is not scheduled, C and L remain the same but we don't 
collect any profit. Now, given this recurrence, the goal is to find the minimal C and some L such that 
profit(n, C, L) > H. This can be solved by dynamic programming, with running time 0(nCmaxLmax)- 
Since C < n?Pn and L < npn, the running time is 0{n^p^), i.e. pseudo-polynomial. 

Therefore, the above dynamic program can be used to compute an optimal solution in polynomial time 
when all processing times are small integers. We now apply scaling techniques to obtain an FPTAS to 
handle arbitrary processing times. 

Handling general instances. Given an instance I of the original problem, we begin by "guessing" 
-fmax) the maximum processing time of a job that is scheduled in some fixed optimal solution. We now 
create a new instance 2' in which every job j with pj > Pmax is discarded; other jobs get a scaled 
processing time of p'j = \pj/K~\, where K = i2ePyaa.x) / {n{n + 1)). Notice that the scaled processing 
times of remaining jobs are integers in [0, \n{n + l)/(2e)]]. We can therefore find in 0(n®/e^) time an 
optimal subset of jobs Jx' to be scheduled in T', and return this set as a solution for Z. 

Theorem 3.4. Scheduling the jobs Jx' if^ order of non- decreasing processing times guarantees that their 
sum of completion times is at most (1 + e)Opt(T). 

Proof. We begin by relating Opt(T) to Opt(2r'). For this purpose, suppose that Jx = {ji, . . . , j/j} in 
an optimal solution to Z. Then, 

r=l s=l r=l s=l 

Now suppose that Jx' = {j'l^ • • • > Jq}- Then, the sum of completion times that results from scheduling 
J J , . . . , j'q in this exact order is 

Q r Q r 

T,T.pr. ^ '<T,Ep'f. 

r=l s=l r=l s=l 

= K ■ Opt(X') 

<Op.(z) + !*i±M 

= Opt(T) + ePmax 
< (1 + e)0pt(2:) . 

where the last inequality holds since -fmax is a lower bound on Opt(^). I 
Constant number of machines. 

We finally consider the case when there is a constant number of identical machines, say m. To this end, 
let profit(j, C, Li, L2, . . . , Lm.) be the maximum profit that can be collected by scheduling a subset of 
the jobs 1, . . . , J such that their sum of completion times is at most C and such that the makespan is 
exactly Li on machine i. Then profit(j, C, Li, . . . , Lm) can be written as 



max 



|profit(j - 1, C,Li, . . . ,L„),max(7rj + profit(j - 1,C - U^Li, ...,Li-pj,.. . ,Lm))| 



When m = 0(1), the size of this dynamic program is still polynomial in n. The remaining analysis is 
similar to the one for a single machine. 
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4 Minimizing Average Flow Time on Identical Machines 

Finally, we consider the problem of minimizing the average (preemptive) flow time on identical machines 
{P\r j, pmtn, outlieTs\ ^ Fj) with unit profits. We present an LP rounding algorithm that produces a 
preemptive non-migratory (no job is scheduled on multiple machines) schedule whose flow time is within 
0(log P) of the optimal, where P is the ratio between the largest and smallest processing times. 

This is the technical heart of the paper; in sharp contrast to the problems in the previous two sections, 
it is not clear how to easily change the existing algorithms for this problem to handle the outliers case — 
while we use the same LP as in previous works, our LP rounding algorithm for the outlier case has to 
substantially extend the previous non-outlier rounding algorithm. Since our algorithms are somewhat 
involved, we flrst present the algorithm for a single machine, and subsequently sketch how to extend it 
to multiple identical machines. For the rest of this section, consider the following setup: we are given 
a single machine and a collection of n jobs where each job j has a release date rj E Z and a processing 
time pj £ Z. Given a parameter H > 0, we want to identify a set of jobs S and a preemptive schedule 
minimizing Yljes-^J (where Fj = Cj — rj) subject to l^j > H. 

4.1 The Flow-time LP Relaxation and an Integrality Gap 

Our LP relaxation is a natural outlier extension of one used in earlier flow-time algorithms ([^, 0]). 
We first describe what the variables and constraints correspond to: (i) fj is the fractional flow time of 
job j, (ii) Xjt is the fraction of job j scheduled in the time interval [t,t + 1), and (Hi) yj is the fraction 
of job j scheduled. Constraint (1) keeps track of the flow time of each job, while constraints (2), (3), 
and (4) are to make sure the solution is feasible with respect to the proflt constraint. Notice that in 
constraint (1), we use the quantity pj (which denotes the processing time pj rounded up to the next 
power of 2), instead of pj. Also, this modification is present only in constraint (1) which dictates the LP 
cost, and not in constraint (2) which measures the extent to which each job is scheduled. The quantity 
T is a guess for the time at which the optimal solution completes processing jobs (in fact, any upper 
bound of it would suffice). We also assume that a parameter /c* G Z was guessed in advance, such that 
the optimal solution only schedules jobs with pj < 2'^* . Our algorithm would have running time which 
is polynomial in T and n. 



minimize Sj=i fj 

subject to (1) h = Y.i"f{t + \-r) + "-f) Vj 

(2) PjVj = T,Lo^jt 

(3) EJ=i^ii<l Vt 

(4) E-=i?/.>n 

(5) Xjt = y j, t : t < Vj 

(6) xjt>0, 0<yj<l Vi,i 

Given the above LP, we first claim that it is indeed a relaxation. 

Lemma 4.1 (Relaxation). Opt(LP) < Opt, where Opt denotes the optimal sum of flow times. 

Proof. Given an optimal solution for the given instance, we construct a corresponding LP solution in a 
natural way, by setting Xjt = Apj when the optimal solution schedules a A fraction of job j in the time 
interval [t,t + l). It is easy to verify that the profit constraint is satisfied. Now, consider a particular job 
j scheduled in the optimal solution. We proceed by showing that the term /' = Ylt=oi^(^~^h~^3^^~^^ 
is a lower bound on the fiow time of j (notice that /' has pj in the denominator where fj had pj). 
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Suppose the optimal solution completes processing j at Cj. The flow time is therefore Cj — rj, whereas 
the worst case for the LP is when j is contiguously scheduled in the time interval [Cj —pj, Cj); otherwise, 
some fraction is scheduled earlier, and the contribution to /j can only decrease. Consequently, 



t + 1/2 -rj ^ Pj 



t=Cj-pj 



Pj 



- rj . 



Since fj < f'j, we have fj < Cj — rj. The lemma follows by summing over all jobs scheduled by OPT. ■ 

Before getting into the details of our algorithm, to gain more intuition for this relaxation, we demonstrate 
that it has an integrality gap of 0(log P), where P is the ratio between the largest and smallest processing 
times in an optimal solution. 

Theorem 4.2 (Integrality Gap). There are instances in which Opt = O(logP) • Opt(LP). 

Proof. Consider an instance where there are k + 1 large jobs numbered 1, 2, . . . , A: + 1. Jobs 1, 2, . . . , fc 
have processing times 2^, 2'^, . . . , 2*^+^ respectively and job k + 1 has a processing time of 2^~^^. In 
addition, there are M = M{k) small jobs of unit processing time, where M is a parameter whose value 
will be determined later. Large jobs 1,2, ... ,k arrive in decreasing order of processing time, where job 



j arrives at the beginning of the white block numbered j in [Figure ij . White block j occupies 2^ time 
units. Job k + 1 arrives at the beginning at the white block numbered A; + 1 which occupies 2^'^^ time 
units. There is also a large grey block occupying M time units; the arrivals of small jobs are uniformly 
spaced in this block (starting at the left endpoint) with a gap of 1. Now suppose we are required to 
schedule M + k/2 + l jobs. 



2' 


2'-' 


2^-2 


2- 


,1/ 2l 


2'+l 














k 


k - 1 


k - 2 










k + 1 



Figure 1: A schematic description of the integrality gap instance. 

We first observe that the optimal schedule picks every small job (as well as k/2 + 1 large jobs). To see 
why, suppose one or more small jobs have not been picked, let q be the minimal index of such a job. 
Note that once we pick a subset of M + k/2 + 1 jobs, an optimal schedule is determined by employing 
the shortest remaining processing time rule (see, for example, [^). Further, from the sizes of the white 
blocks, we see that even if a large job is scheduled without being preempted since its release date, it 
would have a remaining processing time of at least 2 at the beginning of the grey block. Therefore, 
from the SRPT rule, it is clear that the first q — 1 small jobs are scheduled in the first q — 1 time units 
of the grey block. Thus, at the point when job q is released, any large job has a remaining processing 
time of at least 2. It follows that, by picking q and dropping some large job, we can obtain a smaller 
flow time, implying that the schedule under consideration cannot be optimal. 

Further, it is optimal to schedule large job k+l. If there is a solution which does not, we can schedule it 
while skipping one additional large job out of jobs 1,2, ... ,k to improve on the average flow time (this 
holds if M > 2'=+!, which we will ensure later). Therefore, the value of k* in the LP (which denotes the 
largest class scheduled in an optimal solution) will be A; + 1. 

Based on the above SRPT observation, we can conclude that each small job will be contiguously 
processed to completion immediately after its release date, and as a result no large job (from the set of 
jobs 1,2, ... ,k) can be completed any sooner than the right endpoint of the grey block. Hence, every 
large job picked incurs a flow time of at least M, meaning that Opt > Mk/2. On the other hand, a 
fractional solution can fully schedule every small job as soon as it arrives, and schedule half of each 
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large job j into white block j. It can also schedule large job k + 1 completely into white block k + 1. 
It is not difficult to verify that the cost of this solution is at most M + Y^']=i 2^' + 2''+^ < M + 2'=+^. 
Therefore, by setting M = 2^~^^, we have 

Opt Mk 2''+^k k 



Opt(LP) - 2(M + 2'=+2) 2(2^+1 + 2^+2) 6 

where the last equality holds since P = 2'''^^. ■ 

Note that this gap instance is on a single machine, for which we know that the shortest remaining 
processing time policy (SRPT) is optimal in the non-outlier case. However our results eventually show 
that this is as bad as it gets — we show an upper bound of 0(log P) for the integrality gap even for 
identical machines! 

4.2 The Flow-time Rounding Algorithm: General Game Plan and Some Hurdles 

Before we present our algorithm in detail, let us give a high-level picture and indicate some of the 



complicating factors over the earlier work. Previous LP-based rounding techniques |13| , 14] relied on 
the fact that if we rearrange the jobs of length roughly 2*^ — call such jobs "class-fe" jobs — among the 
time slots they occupy in the fractional solution, the objective function does not change much; these 
algorithms then use this rearrangement to make the schedule feasible (no job simultaneously scheduled 
on two machines) and even non-migratory across machines. We are currently considering the single 
machine case, so these issues are irrelevant for the time being (and we will come back to them later) — 
however, we need to handle jobs that are fractionally picked by the LP. In particular, we need to swap 
"mass" between jobs to pick an integral number of jobs to schedule. And it is this step which increases 
the LP cost even in the case of a single machine. Note that we essentially care only about the yj value 
for each job j, which indicates the extent to which this job is scheduled — if we could make them integral 
without altering the objective by much, we would be done! 

However, naive approaches to make the yj's integral may have bad approximation guarantees. E.g., 
consider taking two consecutive fractional jobs j and j' with similar processing times (observe that jobs 
with similar processing times have similar contributions to the objective, except for the release date 
component) and scheduling more of the first one over the second. If the second job j' has even slightly 
smaller processing time than j has, we would run out of space trying to schedule an equal fraction of 
j over j' , and this loss may hurt us in the (hard) profit requirement. In such a case, we could try to 
schedule j' over j, observing that the later job j' would not advance too much in time, since j and j' 
were consecutive in that class and have similar processing times — the eventual hope being that given 
a small violation of the release dates, we may be able to shift the entire schedule by a bit and regain 
feasibility. 

But this strategy could lead to arbitrarily bad approximations: we could keep fractionally growing a job 
j until (say) 2/3 of it is scheduled, only to meet a job j' subsequently that also has 2/3 of it scheduled, 
but j' has smaller processing time and therefore needs to be scheduled over j. In this case, j would 
shrink to 1/3, and then would start growing again — and repeated occurrences of this might cause the 
flow time for j to be very high. Indeed, trying to avoid such situations leads us to our algorithm, where 
we look at a window of jobs and select an appropriate one to schedule, rather than greedily running a 
swapping process. To analyze our algorithm, we charge the total increase in the fractional flow time 
to the fractional makespan of the LP solution, and show that each class of jobs charges the fractional 
makespan at most twice. 

4.3 Notation and Preliminaries 

We partition the collection of jobs into classes, with jobs in class Ck having pj € (2^~^, 2^"']. Notice that 
Pj = 2^ for every j G C^, and the class of interest with highest index is C^*. Given a fractional solution 
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(x, y, /), we say that job j is fully scheduled if yj = 1, and dropped if yj = 0; in both cases, j is integrally 
scheduled. Let flow(x, y, f) = Yl^=i fj fractional cost; note that this is not the same as the actual 

flow time given by this solution, but rather an approximation. Let ■P(x, y, /) = X]"=i X^^o ^i* 
total fractional processing time. Since each job j gets xjt amount of processing time in [t,t + 1), the 
cost of {x, y, f) remains unchanged if all jobs are processed during the first part [t, t + X]j=i ^jt) ^^^^ 
unit interval; we therefore refer to \t + X]"=i Xjt, t + 1) as the free time interval in [t, t + 1). 

We say that an LP solution (x, y, /) is "non-alternating" across each class if the fractional schedule does 
not alternate between two jobs of the same class. Formally, the schedule is "non-alternating" if for class 
k and any two class-/c jobs j and j', if yj,yj' > and rj < rji (or rj = rj' and j < j'), then for any 
times t, t' such that Xjt > and xj/^/ > 0, it holds that t < t' . We call a solution "packed" if there is no 
free time between the release date of a job, and the last time it is scheduled by the LP solution. The 



following lemma is proved in [Appendix A.l| ; we assume that we start off with such a solution. 
Lemma 4.3. There is an optimal LP solution {x* ,y* , f*) that is non- alternating and packed. 

4.4 The Flow-Time Rounding Algorithm 

At a high level, the rounding algorithm proceeds in two stages. 

• In Stage I, for each k, we completely schedule almost as many class-A; jobs as the LP does frac- 
tionally (up to an additive two jobs). The main challenge, as sketched above, is to do this with 
only a small change in the fractional flow time and the processing time of these jobs. 

• In Stage II, we add in at most two class-fc jobs to compensate for the loss of jobs in Stage I. Since 
we add only two jobs per class, we can show that the additional flow time can be controlled. 

4.4.1 Flow-Time Rounding: Stage I 

Recall that we want to convert the non-alternating and packed optimal solution {x*,y*, /*) returned by 
the LP into a new solution (x' , y', f) where at least LSjgCfcylJ ~ ^ class-A; jobs are completely scheduled. 
The algorithm operates on the classes one by one. For each class, it performs a swapping phase where 
mass is shifted between jobs in this class (potentially violating release dates), and then does a shifting 
phase to handle all the release-date violations. 

Swapping Phase for Class-fc. Given the non-alternating and packed solution {x* ,y* , /*), we run the 
algorithm for the swapping phase given in Algorithm 2. 

Shifting Phase for Class-Zc. After the above swapping phase for class-A jobs, we perform a shifting 
phase to handle any violated release dates. Specifically, consider the collection of time intervals occupied 
either by class-A; jobs or by free time — by the process given above, this remains fixed over the execution 
of the swapping phase. We now shift all class-A jobs to the right by 2 • 2^ within these intervals. Of 
course, we need to prove that this takes care of all release date violations. 

4.4.2 Analysis for Stage I 

Lemma 4.4. The following properties hold true at the end of Stage I: 
(t) Vix',y',f')<2Vix*,y*,f*) 

(a) The fractional flow time satisfies f\ow{x' ,y' , f) < 4 ■ f\ow{x* ,y* , f*) + 6k*T'{x* ,y* , f*). 

(Hi) The sum of flow times over all fully scheduled jobs is at most 2- f\ow(x' ,y' , F') + k*'P{x' ,y' , F'). 

The analysis proceeds by a somewhat delicate charging argument and the basic idea is the following. 
In [Step 4 of the algorithm, suppose A fraction of a job ji is being scheduled over A fraction of a job 



jfc: we will charge every point in the interval {rjj^,rj^) by an amount A. In the case when a A fraction 
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Algorithm 2 Class-/c Swapping 



1: set {x',y',f') := {x* ,y* , f*). Repeat the steps |2[j5| until LSjgc^J/jJ — 1 class-/c jobs are completely 
scheduled in (x',y',f'). 

2: advance all class-fc jobs as much as possible without violating release dates (for jobs already vio- 
lating release dates, don't advance their starting time any further) within the time intervals that 
are either free or are occupied by class-A; jobs. 

3: let ji the first fractionally scheduled job in the current LP solution {x',y',f'). Let jg+i be the 
first class-A: job scheduled after ji which has processing time Pj^+i < pji, and say the class-/c jobs 
that are scheduled between ji and j^+i are j2,j3, ■ ■ ■ ,jq- Note that all these jobs must have greater 
processing time than . Also, let free denote the total free time between ji and jg+i in the current 
schedule. 

4: if Ylk=2y'k + ^'''^^/Pji ^ 1 ~ yjij then we know that ji can be completely scheduled over the jobs 
J2, is, ■ ■ ■ ijq and the free time; for k = 2 to q, do the following 

4a: if there is some free time (of total length, say, L) between jk-i and jk, schedule a fraction 
A = min(l — y^-^ , L/pj^ ) of ji in the free time, and delete a fraction A from class-A; jobs at the 
rear end of the schedule. Update {x',y',f'). 
4b: schedule a fraction A = min(l — y'j-^^u'jj^ of j'l over a fraction A of job jk (possibly creating 

some free space). Update {x',y',f'). 
4c: if A: = g and there is some free time (of total length, say, L) between jg and jq+i, schedule a 
fraction A = min(l — y'-_^,L/pj^) of ji in the free time, and delete a fraction A from class-A; 
jobs at the rear end of the schedule. Update {x',y',f'). 
5: else if ^21^2 y'k + ^^^/Pii < 1 — u'j^-, do the following 

• delete a total fraction min(^^^-^ y^iy'j^^-^) from a prefix of jobs ji, j2, • • • and advance the 
current fractional schedule of the job jg+i to occupy the space created. Update the solution 
{x',y',f'). Note that it may or may not have been possible to schedule ji in the space 
fractionally occupied by jobs j2,j3, ■ ■ ■ ,jq and free time in this interval; for accounting reasons 
we do the same thing in both cases. 



of ji is being scheduled over an interval of free time beginning at t, we will then charge every point in 
the interval (r^^ , t) by the fraction A. We then go on to show that flow(x', y' , /') — flow(x*, y*, /*) is not 
too much more than the total charge accumulated by the interval [0, T] (recall that T is the last time 
at which the LP scheduled some fractional job). To complete the proof, we argue that the total charge 
accumulated is 0{log P)V{x* ,y* , f*). 

In Appendix A.2| we restate Stage 1 in a slightly different way, where we also define the charging process 



charge associated with each step of the algorithm, and give the complete proof of Lemma 4.4 . 
4.4.3 Flow-Time Rounding: Stage II 

The fractional solution (x', y' , F') may not be feasible, since we have only scheduled LXljeCfc y*j\ ~ ^ 
from class-A;. Hence, for each class-A;, arbitrarily pick the minimum number of non-fully-scheduled jobs 
to bring this number to lYlj^Ck most two per class). These jobs are preemptively scheduled as 

soon as possible after their release date. Since at most two jobs per class are added, the flow time does 
not change much. 

Lemma 4.5. The sum of the flow times of all added jobs is at most k*{'P{x', y', F') + 2^^' ~^^). 

Proof. For a class-A;, we may have to complete two additional jobs. When we schedule an extra job as 
soon as possible, it waits only for jobs that were fully scheduled during stage 11 or for jobs that were 
added in previous iterations of the current stage. Therefore, its flow time can be at most 'P{x' , y' , F') + 
2 Ylk*=i and therefore the total flow time of added jobs is at most k*{'P{x',y', F') + 2^*+"^). ■ 
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Because fj is lowerbounded by x*j/2, we have that V{x* ,y*, /*) < 2 • Opt. Therefore, Lemmas 



and ^ in conjunction with the inequahties V{x' ,y' , F') < 2V{x* ,y* , f*) < 4 • Opt and k* < log P + 1, 
prove the following result for minimizing flow time on a single machine. 

Theorem 4.6. The problem of minimizing flow time on a single machine with unit profits can be 
approximated within a factor of O {log P). 

4.5 Flow-Time: Identical Parallel Machines 



We conclude this section by showing how to combine our single machine algorithm along with ideas 
drawn from to obtain an 0(logi-*) approximation for the case of identical machines. We begin by 
solving a natural extension of the single machine LP to the setting of identical machines; let (x*,y*, /*) 
be the resulting fractional solution. 

• Stage I: We rearrange the jobs to make the schedule non-migratory, while preserving the fraction 
to which each job has been scheduled. This modification is done by performing the procedure 



given in |13| with the only change being that the jobs in {x* ,y* , f*) are fractionally scheduled. 
The resulting solution {x,y,f) can be shown to have an LP cost of at most flow(a;*, y*, /*) + 
0{logP)V{x*,y*J*). 

Stage II: For each machine, we execute Stage I of the single machine algorithm. As a result, al- 
most all jobs are now integrally scheduled, leaving at most two fractionally scheduled jobs per class 
and machine, without increasing the LP cost by much. If {x',y', /') denotes the LP solution after 
this stage, we can show that flow(x', y', f')<2- i\o\i\/{x* ,y* , f*) + 61og P ■ V{x, y, f) (The analysis 
is identical to that for Stage I of the single machine algorithm, presented in [Appendix 4.4. 1| ) . 
Stage III: We consider all fractionally scheduled class-fc jobs (there are at most 2 per machine) 
and schedule more of the job with least processing time, while deleting an equal fraction from the 
one with largest processing time, until either the small job is fully scheduled or the large job has 
been completely dropped. This procedure is repeated till only at most one fractional class-fc job 
remains. The entire process is repeated for each class. In Appendix A.3| , we show that the LP cost 



satisfies flow(x,y,/) < flow(x', y', /') + 21ogP • V{x' ,y' , f) where {x,y,f) denotes the solution 
after Stage III. Subsequently, bounding the actual sum of flow times (of the integrally scheduled 



jobs) also follows closely to Lemma A. 6. 



• Stage IV: As in the single machine case, we schedule the one remaining fractional job for each 
class by adding processing time whenever possible. This change does not significantly increase the 
solution cost much like [Lemma 4.5| . 

The algorithm and the proofs depend heavily on the single machine case; we give more details in 



Appendix A. 3 . To conclude, we have the following theorem. 

Theorem 4.7. The problem of minimizing flow time on identical machines with unit profits can be 
approximated within a factor of O {log P). 
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A Proofs from Section |^ 

A.l Proof of Lemma |4.3|: Non- Alternating and Compact Optimal Solutions 



Consider doing the following changes for every class 1 < k < k* in some arbitrary order. 

1. Let T{k) be the union of all time intervals in which the LP solution {x* ,y* , f*) schedules class-fc 
jobs along with the overall free time. 

2. Use T{k) to continuously schedule a y* fraction of each job j € Ck- These fractions are scheduled 
in increasing order of release dates as soon as possible (while respecting release dates). 

Let (x, y, f) be the resulting LP solution after we finish this operation. Notice that this solution is non- 
alternating and packed. Also, every job is still scheduled to the same extent as before, i.e. yj = y*, and 
consequently, we have, V(x, y, f) = V{x* , y*, F*). It remains to show that flow(x, y, f) < f\ovj{x* ,y* , /*). 
Let Vk,t{x,y, f) = J2jeCk Ylt'=t^jt' be the overall processing time of class-A; jobs after time t. Clearly 
since we are only rearranging class-A; jobs (and subsequently advancing jobs whenever possible) within 
the time intervals in which they were scheduled in {x* ,y*, /*), we have Vk^t{x, y, f) < Vk^ti^* ,y* , /*) for 
every k and t. Now, 



flow(x, y, /) - flow(x*, y*, /*) = Y.[fj- f* 

^EEE((|(-i-.^f)-(f 

k=l j^Ck t=o ^^^^ ^ ^ ^ ^'■^ 

k* T 



*+2 



1 \ X 



t{Xjt Xjf) 



k=i jeCk t=o 
k* ^ T 

k=i t=o jeCk 

k* ^ T 

= E ^ E y> /) - yk,t{x*, y*,n) < o 

k=l t=0 

The second equality holds since X^^^q ^jt ~ St=o ^jt every j. The last equality holds since 
ELo * EjGCfe ^jt = ELo yk,tix, y, f) and ^Lo * EjeCfe ^*jt = J2j=o yk,tix*,y*, /*). 



A. 2 Proof of Lemma |4.4| : The Stage I Algorithm 

The first part is simple to prove. Observe that the only step where the Stage 1 algorithm would schedule 
a larger job while deleting a smaller job is in Step 4a (or Step 4c). However, even in this case, since it 
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operates within a particular class, the worst case would be scheduling A fraction of a job j in some free 
time, while deleting A fraction of a job j' whose processing time is half that of j. Therefore, it follows 
thatV{x',y',f')<2V{x*,y*,f*). 

For the proof of the next two parts, we describe an equivalent form of the swapping algorithm which 
operates on each time slot [t, t + 1) one by one, rather than job by job. As the algorithm proceeds, the 
solution (x',y',/') keeps getting updated: initially (x',y',/') = (x*,y*,/*). We introduce a charging 
scheme charge(t, j) which is initially set to for every t G [0,T] and j € {1, . . . ,n}. It is progressively 
modified over the course of the algorithm, and helps us bound the increase in LP cost as the jobs are 
make integral. 

Revisiting the Swapping phase for class-A;. If the number of fully scheduled class-A; jobs in the 
current solution is smaller than [XljeCfe ^jJ ~ ^' ^^^^ advance all class-/c jobs as much as possible 
within the union of class-fc time intervals along with the overall free time. That is, we make sure that 
there is no free time between the release date of any fractionally scheduled class-fc job and the last time 
interval in which some fraction of it is scheduled. Note that this does not cause any increase in cost of 
the LP solution. 

Now consider some stage of the swapping stage where ji be the first fractionally scheduled class-fc job, 
and let j2, ■ ■ ■ ,jg be a prefix of the class-A; jobs scheduled after ji defined thus: q is the minimal index 
for which Pj^^^ < Pj^ or for which jg is the last scheduled class-A; job (see Figure 2 ). For any s < q, let 
free(js, js+i) be the overall amount of free time between the last interval in which js is scheduled and 
the first interval in which j^+i is scheduled^. There are two cases to consider: 

I I Class I: jobs | | Free Time ^^^^ Other class jobs 









.il 


;1 


'I 





Case I: 

scheduled: 



Figure 2: A prefix of class-fc jobs (schematic illustration). 
+ Yll=i^'^^^Us,js+i)/Pji > ^ ~ y'ji' t^i^ scenario, repeat until ji becomes fully 



J2 



0, let 



+ 1. 



1. Let s G {2, ■ ■ ■ ,q} be the minimal index for which y'j^ > 0. If y'j 

2. If free(ji,js) = 0, let [t,t + 1) be the first time slot where js is scheduled. We now replace a 
At = min{l — yjj^,x'j^t/pj^} fraction of js by a At fraction of ji, possibly creating some free time. 
We also advance all class-fc jobs starting from jg as much as possible (without violating release 
dates) within the union of all time intervals in which these jobs are scheduled along with the 



overall free time. An illustration of this step (before the advancement) is shown in Figure 3 



Figure 3: Example of Case I. Notice the free time created due to pj^ being smaller than pj^ . 



Charging: When a At fraction of js is replaced by a A^ fraction of ji , we say that each point of the time interval 
{rj,,rjj pays Atpjjpj^ on behalf of ji. I.e., set charge(i', ji) ^ charge(i', ji) + AtPjJpj, for all t' e {rj^^r^J. 

^If jg is the last scheduled class-fc job, we define free{jq, jg+i) = oo. 
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Cost increment : When we replace js with ji , the extra LP cost paid by ji is + 5 ^ ^ji ) + ^*2^^ whereas 

the cost saved by removing a fraction of js is + 5 ~ ^is ) + '^'2^° • The cost increase is 



which is exactly the increment in X^jgCi, charge(t, The inequality above holds because pj^ > pj-^. 

3. Otherwise (i.e., free(ji,js) > 0) let / be the first free time interval between the last interval in 
which ji is scheduled and the first interval in which jg is scheduled. Also, let [t, t + 1) be the first 
time slot having a non-empty intersection with /, and let ji be the last fractionally scheduled class- 
k job. Note that j/ cannot be any of the jobs j2, ■ ■ ■ ,jq+i, as the number of fully scheduled class-A: 
jobs is at most LSjeCj; ^jJ ~ ^- Now, we schedule an extra At = min{l — y'j^ , y^-^ , |/n [t,t + 
fraction of ji in In [t,t + 1), while continuously deleting a At fraction of ji from the intervals 
where this job is scheduled, in reverse order of time. 

Charging: If rj-^ < t, each point of the time interval {rj^,t) pays AtPjilPji on behalf of ji. That is, we set 
charge(t', ji) ^ charge(i', ji) + AtPjJpj^ for every t' e {rj,,t). 

Cost increment : The extra cost paid by ji is ^-^'^ + 5 ~ '"ji ) + ' '^hile the cost saved by deleting a At 
fraction of is at least AtPjj2. Since ji and ji belong to the same class, the cost increment is at most 

^(t-r,J + 3.^. 



Pii 



When < t, the first term is exactly the increment in ^^^^^ charge{t, j)dt as a result of setting 
charge(i', ji) <— charge(i', ji) + Atpj^/pj^ for every t' e {rj^,t). In the opposite case, this term is negative, 
which is why we do not need to modify the charging function. In addition, the term AtPjj2 lower bounds the 
contribution of the deleted fraction of ji towards the quantity flow(a;*, y*, /*). 

Case II: Yl'i=2yjs + Ss=i < ^ ~ Uji- this case, repeat until jg+i becomes fully 

scheduled or until y'- =■■■=■«' =0: 

1. Let s € {1, . . . ,q} be the minimal index for which y'j^ > 0. 

2. Let [t,t + 1) be the first time slot where js is scheduled. We now replace a = min{l — 
y'jq+i^^'jat/Pjs} fraction oi js by a A^ fraction of jg+i, possibly creating some free time. 

Cost increment : The extra cost paid by jg+i is -'^''^^ (^+5 ^^i^+i) ^ whereas the cost saved by 

scheduling a smaller fraction of js is ^" 5 ~ ^js) + Since ps > pj^ > , the cost increment is 

and there is no need to modify the charging function. 

Conclude this case by making the following rearrangements: 

1. Continuously schedule a y'j^^-^ fraction of jg+i within the union of class-A; intervals and free time, 
starting at the first interval in which jq^i is currently processed. Even though we may have 
violated the release date of jg+i, the earliest time in which any part of this job is processed was 
advanced by at most 2 • 2^ within the union of time intervals where class-fc jobs are scheduled 
and free time intervals, since we initially had Yll=2yjs + Ylt=i^'''^^Us, js+i)/Pji < 1 — y'j^- This 
anomaly will be handled in the sequel. 
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2. Following jq+i, proceed by scheduling a yj_^, . . . , y'j^ fraction of ji, . . . , jg, respectively, as soon as 
possible (without violating release dates) within the time intervals where class-/c jobs are scheduled 
and free time. 

We are now ready to prove Lemma Noting that the fractional contribution of each class changes 
only during the course of its corresponding iteration, we may focus our attention on a fixed class-fc, and 
bound its fractional cost, flowfc(x',y', /') = X^jgCfc ^j- 

Claim A.l. Just before the shifting phase for class-k, we have 

flowfc(x',2/',/') < 4-flowfe(x*,y*,r) + T Y] chavge{t, j)dt . 

Proof. We can bound the cost increment of each operation in the swapping phase as follows: 

• A single operation in case I, [step 2| : As mentioned in the algorithm, the cost increment is upper 
bounded by the increment in ^j^Ck charge(t, 

• A single operation in case I, ^tep 3| : In these settings, the cost increment can be bounded by the 
increment in j'^ ^■^^^c\\arge{t, j)dt plus thrice whatever the deleted fraction of j; contributes 
to flow(x*, y*, /*). It is important to observe that deleted fractions will not be used later on to 
bound additional cost increments for subsequent operations for this stage. 

• Operations in case II: Because we are only rearranging jobs within class-/c space (and never 



introduce any free time), arguments similar to the proof in Appendix A.l show that no extra cost 
is incurred in this step. 

Therefore, at the completion of the swapping phase, we have 

[■T 

flowfc(x',y',/0-flowfc(x*,y*,r) < / V charge(t, j>i + 3 • flowfe(x*, y*, /* ) . 
We proceed by establishing a few crucial properties of the charging function. 

Claim A. 2. Just before the shifting phase, no free time ever pays on behalf of any job. In other words, 
if^j^C^ charge(t,j) > then t cannot be free time. 

Proof. We prove the above claim by arguing that, whenever an interval is charged, each of its points is 
currently dedicated to processing some job. It is not difficult to verify that our algorithm preserves this 
property till the swapping phase terminates. Consider an operation where some interval is charged. 



• In case I, ^tep 2[ suppose that a fraction of js is replaced by a fraction of ji. Then, the 
charging scheme increases charge(t', ji) by Atpj^^/pj-^ for every t' £ ij'j^,rj^). However, we are 
guaranteed not to have free time in the interval (r-,-^ , rj^ ) because of the fact that there is no free 
time between and the last interval in which ji is scheduled (any such free time is eliminated 
when we begin the swapping phase for class-fc), and because free(ji,js) = 0. 



In case I, [step 5 , the algorithm picks the first time slot (say, + 1)) which has some free time 



between the last interval in which j'l is scheduled and the first interval in which js is scheduled. 
Suppose that an extra A^^ fraction of ji is scheduled, increasing charge(t', ji) by IS.t^pj^/pj^ for 
every t' S (rj^,ti). Note that there cannot be free time between the last interval in which ji is 
scheduled and ti (by the way [ti, ii + 1) was picked), and also between r^^ and the last interval in 
which j'l is scheduled. Therefore, there is no free time in (rj^,ii). 
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Claim A. 3. Each point in time pays on behalf of at most one job. That is, for every t G [0, T], 



|{jGCfc:charge(t,i)>0}|<l . 

Proof. We prove the above claim by contradiction. For this purpose, suppose there exists a point in 
time t* G [0, T] that pays on behalf of two jobs, say j and j'. By the way we charged jobs in the 
swapping phase, we know that both j and j' must be fully scheduled. Without loss of generality, we 
assume that j appears before j' in the schedule {x*,y*, Consider a single operation in which t* is 
charged, paying some amount of behalf of j. 



Suppose step 2 is executed in time slot [t, t + 1), where fraction of a job js is replaced by 
fraction of j during which t* pays on behalf of j. Since t* is charged in this operation, it follows 
that t* € (rj, rjj. In addition, we have rj^ < rji, or otherwise j' must have been fully replaced by 
j during previous operations, since j is currently replacing jg. Therefore, t* < rj^ < rji, implying 
that t* cannot be paying on behalf of j' , since our charging scheme guarantees that a time point 
can pay on behalf of a particular job only when it appears after the release date of this job. 



On the other hand, suppose [step 3 is executed in time slot [t, t + 1), where a Af fraction of j is 



scheduled. Since t* is charged, it follows that t* £ {rj,t). Now, if t* pays on behalf of j', we must 
have rj' < t* < t, meaning that at the moment there is some free time between r^/ and the first 
interval in which j' is scheduled. Such free time would have been eliminated at the beginning of 
this iteration as a result of advancing class-/c jobs. 



Claim A.4. Ej^c, charge(t,i)dt < 2r{x* ,y* , f*). 



Proof. By [Claim Al3 , we know that each point in time pays on behalf of at most one job per class. 



Also, whenever a point t is charged on behalf of a job j, the increment in charge(t,j) is of the form 
Apj/pj, where A is the additional fraction of j being scheduled. Therefore, the total amount t can pay 
on behalf of j is at most Pj/pj < 1- This bound, coupled with the observation that free time never pays 
on behalf of any job (see piaim AT^ ), proves that 



rT 

/ V charge(t,j)di < V{x',y',f) < 2V{x*,y*J*) 



The last inequality holds since, throughout stage II, the overall processing time cannot grow by a factor 
greater than 2: Whenever we schedule an extra fraction of some class-A; job, we also delete an equal 
fraction from some other class-fc job, which in the worst case has half its processing time. ■ 

Claim A. 5. Immediately after the shifting phase, we have 

flowfc(x',y',/') <4.flowfe(x*,y*,r) + 6P(x*,y*,r) . 



Proof. By combining [Claims A.l| and A.4[ , we can bound the fractional cost of class-A; just after the 
swapping phase by 

flowfc(x',y',/') < 4-flowfc(x*,y*,r)+ charge{t, j)dt 

<4■f\o^Nk{x*,y*J*) + 2V{x*,y*,f*) . 



In addition, arguments nearly identical to those of Garg and Kumar |14, Clm. 4.3] show that the cost 



increment due to the shifting phase is at most 2V{x' , y', f) < 4'P(x*, y*, /*). 



This assumption implies r-j < Vji because the LP solution is assumed to be non-alternating. 
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Part (ii) of Lemma 4.4 is now derived by summing the inequality stated in Claim A. 5 over all classes: 



flow(x',y',/') = Y,^\o^Nk{x',y'J') 



k=l 

k* 



< 4 flowfc(x*, y*, /*) + 6k*V{x*, y\ f*) 

k=l 



The third part of Lemma 4.4 follows from the next lemma. 



Lemma A. 6. The sum of flow times of all integrally scheduled jobs is at most 2 ■ flow(x', y', /') + 
k*V{x',y',f'). 

Proof. Consider some fully scheduled job j. It is easy to verify (see [^] for a proof) that the quantity 
2Fj is at least the actual flow time of j minus the amount of time for which j has been preempted 
(which cannot include free time). In addition, our algorithm ensures that, at any point in time, at most 
one class-/c job may be preempted. Hence, by summing over all fully scheduled class k jobs, it follows 
that their sum of flow times is bounded by 2 J2jeCk ^(^'j y'> /') = ^ • flowfc(x', y', f) + V{x' , y' , /'). 
The desired result is obtained by summing over all classes. ■ 



A. 3 Parallel Machines Algorithm 

For completeness, we first provide the natural extension of the flow time LP for identical machines. 
Then, we present the algorithm in more detail. 



minimize X]j=i fj 

subject to (1) /.=i:E(|f + V, 

(2) pjyj = Xlt^o Elli Xijt Vj 

(3) Yl]=i^ijt<'^ 

(4) E-=i% >n 

(5) Xijt = yi,j,t:t<rj 

(6) Xijt>0, <yj <1 yi,j,t 



The Algorithm: 

Stage I: Let (x*, y* , /*) be an optimal LP solution. We first rearrange the jobs following the procedure 
given in |jl^ to make each job's schedule non-migratory whilst preserving the fraction to which it has 
been scheduled. Let (x, y, f) be the updated solution. A proof identical to Lemma 3.3 in [|l3| shows 
thatflow(S,y,7) <flow(x*,2/*,r)+0(logP)P(x*,y*,r) and that P(S, y, /) < P(x*, y*, /*). 

Stage II: Let flow^x, y, /) of an LP solution (x, y, /) be ELo (if + ^ " ^i) + ^) ' and V'{x, y, /) = 

'YlijYlit=Q^ijt- For each machine, run Stage I of the single machine algorithm: let {x',y',f') be 
the (possibly infeasible) solution obtained. From the analysis of the single machine case, we have 
flow*(x',y',/') < 2 • flow*(x,y,7) + 6(logP)P*(x,y,/), and P*(x', y', /') < 2P*(x,y,/). Further, for 
each k, the number of class-k jobs completely scheduled on any machine i in (x',y',/') is at least the 
fractional number of class-/c jobs scheduled on machine i by (x*,y*,/*) (up to an additive 2 jobs). 
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Stage III: We now handle the infeasibihty of {x',y', /'): each class may still have up to 2 fractionally 
scheduled jobs on each machine. We set {x,y,f) := {x',y',f'), and make changes to {x,y,f). Like in 
the single machine case, we swap jobs to make them integrally scheduled. For each k, 

Ilia: Advance all class-A: jobs as much as possible (within time occupied by class-A: jobs and free time) 
such that there is no free time between when a job is released and when it is scheduled in (x, y, /). 

Illb: Repeat the following until there is at most 1 fractionally scheduled class-A; job in (x,y, /): 

Let ji be the fractionally scheduled class-A: job with largest processing time and j2 be the one 
with smallest processing time. Keep adding j2 to the end of the schedule on the machine in which 
it has currently been scheduled, while deleting an equal fraction from ji until either (i) j2 is fully 
scheduled, or (ii) ji has been completely deleted. 



Analysis. Suppose the algorithm is replacing ji (scheduled on machine ii) with j2 (scheduled on 
machine 12)- Instead of performing the replacement in one shot, we could also do it in a time slot by 
time slot basis. Let the last time interval in which ji is scheduled be [ti,ti + 1), and let [t2,t2 + 1) be 
first interval that has free time, after the fractional completion of j2. The algorithm deletes a fraction 
A = mm{xi-^jj^t/Pjn 1 — yj2' (1 ~ 'Ylij^i^jt^) /Pj2) of ji and schedules A fraction of 72 in the free time 
in \t2,t2 + 1) on machine «2- Intuitively, A is the minimum of the fraction of ji that is scheduled in 
+ 1), the fraction of j2 needed to make it fully scheduled, and the fraction of j2 that can be 
scheduled in the free time in \t2,t2 + 

Observe that because pj^ < pj^ , the additional cost incurred by the modified LP solution is at most 
(*2 - rj, + 1)A + ^ - (ti - rj, + i)A + ^ < (^2 - r.-JA* 

Also notice that every point in the interval (^^2,^2) is not free time, by the way t2 was chosen. 

We then employ a charging scheme where each point t on the time interval {rj^,t2) pays an additional 
charge of A towards job j2. The following properties are then true at the end of this stage: 

(a) Each point pays at most 2 on behalf of jobs belonging to a class on each machine. This is because 
there can be at most 2 fractional jobs per class on each machine in the solution {x',y',f') and a 
point in time pays only for a fractional job which becomes completely scheduled. 

(b) Any point which pays on behalf of a job cannot be "free time". This follows because we are 
guaranteed that there is no free time in any charging interval. 

(c) The total processing time in the LP solution does not increase: V{x,y,f) < V{x' ,y' , f). This 
holds because we always replace a fraction of a larger job with an equal fraction of a smaller job. 

Therefore, at the end of this stage, the cost of the updated LP solution (x, y, /) is bounded by 

flow(£, y, J) < flow(x', y', /') + 2(log P)V{x',y', f) 
and the total processing time by 

r{x,yj)<r{x',y',f') . 



Stage IV: After Stage III, we might still have at most one fractional job per class. To handle this, for 
each class-A;, we completely schedule the last remaining fractionally scheduled class-A; whenever possible 
on the machine in which it has been fractionally scheduled. The analysis for this step, and the one for 
bounding the actual sum of flow times of the integrally scheduled jobs is analogous to the one for the 
single machine case. This proves Theorem 4.7 . 
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B Average Weighted Completion Time 



B.l Weighted Completion Time with K Profit Constraints 



We now consider an extension of the problem studied in pection 3| : one in which there are K different 
profit requirements of the form T^jVj > H'^ for 1 < k < K. We highlight the changes to be made to 
our algorithm and then present its analysis. 

Necessary modifications: 

(a) KG constraints for each profit requirement are written down in the LP. Analogous to the single 
profit requirement case, we define vr^' = min{7r^,n'^ — n'^(^)} for each subset of jobs A and 



l<k<K. 

(b) We set A* = {j : yj > 1/(3k}- That is, instead of rounding up each yj by a factor of 2, we round 
these variables by a factor of Pk, a parameter whose value will be determined later. 

(c) For jobs in A*, we mark each machine/time pair tj = {i,t) with probability Xijt/{pijyj). For jobs 
not in A*, we mark each machine/time pair tj = {i,t) with probability PKXijt/pij- Essentially, 
we pick jobs in A* with probability 1, and every other job with probability PrVj- 

Analysis. The proof that the expected cost is within a factor of 0{Pk) of optimal is nearly identical 
to that of the single profit requirement case. Therefore, we would like to fix (3k such that all profit 
constraints are simultaneously satisfied with constant' probability. To this end, consider one such profit 
requirement H^. We upper bound the probability that the collection of jobs picked does not satisfy this 
requirement. Consider the knapsack cover inequality for A* with respect to requirement k, stating that 



E 
z 



■HA* 



yj > n — Yl {A*). The total profit collected from jobs not in A* can be lower bounded by 
Zj] here, each Zj is a random variable indicating whether job j is picked. To provide 

n^(^*), note that 



an upper bound on the probability that Z falls below n'^ 



k,A* - 



Consequently, let us define aj = ttj' /(H'^ — n'^(^*)). Since our algorithm independently picks each 
job not in A* with probability (3x1/ j, we have 



Pr [Z < U'' -U''{A*)] = Pr 



< exp 



< Pr 



/3 a- 



where the first and third inequalities hold since E[^j.^_^, '^j^j] ^ Pk, said the second inequality follows 
from the Chernoff-type bound on the lower tail of the sum of independent [0, 1] r.v.s (see, e.g., |||, Thm. 
3.5]). 

We then fix Pk such that exp(— (/?/^ — l)^/2/?i^) is at most 1/lOK (it suffices for Pk to be 0{logK) 
for this to hold). Consequently, by the union bound, the probability that some profit constraint will 
not be satisfied is at most 1/10. It follows that our randomized algorithm computes a schedule whose 
expected cost is 0(/3A')0pt, and all the profit constraints are met with 0(1) probability. 
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